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Abstract 

Write T for the set of homomorphisms from {0, l} d to Z which send 
to (think of members of F as labellings of {0, l} d in which adjacent 
strings get labels differing by exactly 1), and Ti for those which take 
on exactly i values. We give asymptotic formulae for \T\ and 

In particular, we show that the probability that a uniformly chosen 
member f of T takes more than five values tends to as d — > oo. This 
settles a conjecture of J. Kahn. Previously, Kahn had shown that there 
is a constant b such that f a.s. takes at most b values. This in turn 
verified a conjecture of I. Benjamini et al., that for each t > 0, f a.s. 
takes at most td values. 

Determining | F\ is equivalent both to counting the number of rank 
functions on the Boolean lattice 2^ (functions /: 2^ — > N satisfying 
/(0) = and f(A) < f(A U x) < f(A) + 1 for all A G 2^ and x G [d]) 
and to counting the number of proper 3-colourings of the discrete cube 
(i.e., the number of homomorphisms from {0, l} d to K3, the complete 
graph on 3 vertices). 

Our proof uses the main lemma from Kahn's proof of constant 
range, together with some combinatorial approximation techniques in- 
troduced by A. Sapozhenko. 



Key words and phrases: graph homomorphism, Hamming cube, rank function, graph 
colouring. 
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1 Introduction 



1.1 Background and statement of the result 

Write Qd for the d-dimensional Hamming cube (the graph whose vertex set 
is {0, 1} and in which two vertices are joined by an edge if they differ in 
exactly one coordinate). Set 



(That is, T is the set of graph homomorphisms from Qd to Z, normalized 
to vanish at 0.) 

In [2], this set of functions is studied from a probabilistic point of view, 
a motivating idea being that a typical element of J- should exhibit stronger 
concentration behavior than an arbitrary element. Put uniform probability 
measure on T, and define the function R on T by R(f) = {f{v): v E V{Qd)} 
[R is the range of /). In [2] the following conjecture is made about the 
concentration of \R\: 

Conjecture 1.1 For each t > 0, P(|-R| > td) — > as d — > oo. 

In [7], something stronger is proved, and something stronger still conjec- 
tured: 

Theorem 1.2 There is a constant b such that ~P(\R\ > b) = e~ n ( d \ 

Conjecture 1.3 P(|i?| > 5) = e _n W and P(\R\ = 5) = 

In this paper we prove Conjecture 11.31 by (asymptotically) counting the 
number of homomorphisms with various ranges. Specifically, if we set 



T = {/: V(Q d ) Z: /(0) = 0andii~^ \f{u) - f(v)\ = 1}. 



Ti = {feT:\R(f)\=i}, 



we prove 



Theorem 1.4 



1^3 1 
1^5 1 



T\ 



(2e±e- n ^)2 2d - 1 
(2±e- n ^)2 2d - 1 
(4^-4±e-^))2 2d " 
(2e- 4^ + 2±e- Q(d) )2 2 ' 



,d-l 
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which gives Conjecture [L3j Setting T<b = Uj^sJ^, we see that Theorem 1 1.41 
has the following weaker but more elegantly formulated consequence: 

Corollary 1.5 \T\ ~ |J"< 5 | ~ 2e2 2d ~\ 

Corollary II . 51 makes sense: a little thought suggests that a typical mem- 
ber of J- should be constant on either even or odd vertices of the cube, 
except for a small set of "blemishes" on which it takes values 2 away from 
the predominant value, and take just two values on vertices of the other 
parity. 

The problem under discussion is equivalent to the question of the number 
of rank functions on the Boolean lattice 2^ (here [d] = {1, . . . , d}). A rank 
function is an /:2^ — > N satisfying /(0) = and f(A) < f(A U x) < 
f{A) + 1 for all A € 2^ and x G [d]. An easy lower bound on the number 
of rank functions is 2 2d 1 (consider those functions which take the value 
k/2 on each element of the kth level of the Boolean lattice for each even 
k). Athanasiadis [lj conjectured that the total number of rank functions is 
2 2d This conjecture is proved in [8], where it is further conjectured 

that the number is in fact C>(2 2 ). Theorem O answers this conjecture 
in the affirmative; for, as observed by Mossel (see [7]), there is a bijection 
from the set of rank functions to J-: identifying a subset A of [d] with a 
vertex of Qd in the natural way, the bijection is given by g — > f where 
f(A) = 2g(A)-\A\. 

Theorem 11.41 also provides information about the number of proper 3- 
colourings of Qd- A proper 3-colouring of a graph G with vertex set V and 
edge set E is a function %: V — > {0, 1,2} satisfying (x,y) £ E => x( x ) 7^ 
x{y). Theorem 11.41 implies that the number of proper 3-colourings of Qd is 
asymptotic to 6e2 2d l ; for, as observed by Randall [13], there is a bijection 
from T to the set of proper 3-colourings of Qd with %(0) = 0: the bijection 
is given by / — > x where x( v ) = i iff f(v) = i (mod 3). 

The main inspiration for the proof of Theorem 11.41 is the work of A. 
Sapozhenko, who, in |15| . gave a relatively simple derivation for the asymp- 
totics of the number of independent sets in Qd (earlier derived in a more 
involved way in [TTJ ) . Our Lemma 17.21 is a modification of a lemma in |14j , 
and our overall approach is similar to [15]. The other key ingredient in our 
proof is the main lemma from [7] , which was already used by Kahn to give 
Theorem 11.21 

In the rest of this section, we establish basic notation and gather together 
the main external ingredients that will be used in the proof of Theorem ll.41 
before giving an outline of the rest of the paper. 
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1.2 Notation and conventions 

For graph theory basics, see e.g. [3], [5]. For basics of the combinatorics of 
the Hamming cube, see e.g. [3]. 

The Hamming cube Qd is a (i-regular, bipartite graph. Write V for the 
vertex set of the cube, £ for the set of even vertices (those whose l\ distance 
from is even) and O for the set of odd vertices. Set M = 2 d ~ l = \£\ = \0\. 

For u, v E V and A,C C V we write u ~ v if there is an edge in 
joining u and v , V(vl) for the set of edges having exactly one end in A and 
(when An C = 0) V(A,C) for the set of edges having one end in each of 
A,C. 

Set N(u) = {w G V: w ~ u} (N(u) is the neighbourhood of u), 
N(A) = U w&A N(w), N c (u) = {weC:w~ u}, N C {A) = U weA N c (w), and 
dc(u) = \Nc(u)\. Write p(u, v) for the length of the shortest u-v path in Qd, 
and set p(u, A) = mm w( z a{p(u,w)} and p(A, C) = mm weA:W / & c{p(w,w')}. 
Set B{A) = {v G V: N(v) C A}. 

We say that A is A;-linked if for every u,v £ A there is a sequence 
■u = no, u±, . . . , ui = v in A with p{ui,Ui + \) < k for « = 0, . . . , I — 1. Note 
that for any /c, ^4 is the disjoint union of its maximal /c-linked subsets — we 
call these the ^-components of A. Write C -< A if C is a 2-component of 
^4, and c(A) for the number of 2-components of A. 

We say that A is small if \A\ < a d for a certain constant a < 2 that 
will be discussed in Section [2] (and large otherwise), sparse if all the 2- 
components of A are singletons (and non-sparse otherwise), and nice if A 
is small, 2-linked and of size at least 2. Note that all sets A that we will 
consider will satisfy either A C £ or A C Q. 

For integers a < b we define [a, 6] = {a, . . . , b}. 

We use "In" for the natural logarithm and "log" always means the base 
2 logarithm. The implied constants in the O and SI notation are absolute 
(independent of d). We always assume that d is large enough to support our 
assertions. No attempt has been made to optimize constants. 

1.3 External ingredients 

We list here the main results that we will be drawing on in the rest of the 
paper. 

We begin with a lemma bounding the number of connected subgraphs of 
a graph. The infinite A-branching rooted tree contains precisely (^ n ) /((A — 
l)n+ 1) rooted subtrees with n vertices (see e.g. Exercise 11 (p. 396) of [9]) 
and this implies that if G is a graph with maximum degree A and vertex set 
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V(G) then the number of n-vertex subsets of V(G) which contain a fixed 
vertex and induce a connected subgraph is at most (eA) n . (This fact is 
rediscovered in [2].) We will use the following easy corollary. 

Lemma 1.6 Let E be a graph with vertex set V(S) and maximum degree 
A. For each fixed k, the number of k-linked subsets ofV{Ti) of size n which 
contain a fixed vertex is at most 2°( nlo s A ). 

This follows from the fact that a A:-linked subset of £ is connected in a graph 
with all degrees 0(A fc+1 ). 

The next lemma is a special case of a fundamental result due to Lovasz 
|12j and Stein |16j (see also [6]). For a bipartite graph £ with bipartition 
X L)Y, say Y' C Y covers X if each x £ X has a neighbour in Y' . 

Lemma 1.7 If a bipartite graph £ with bipartition XUY satisfies d(x) > a 
for all x € X and d(y) < 6 /or a// j£r", f/ien X is covered by some Y' C Y 
of size at most (\Y\/a)(l + lno). 

The next lemma is from [14] (see Lemma 2.1); the reader should have 
no difficulty supplying a proof. 

Lemma 1.8 IfT, is a graph on vertex set V(S) and i,CC V(S) satisfy 

(i) A is k-linked 
and 

(ii) /?(«, C) < / /or eac/i aGi and p(-u, A) < I for each v € C , 
then C is (k + 2l)-linked. 

The main step from the proof of Theorem 11.21 in [7J (obtained via en- 
tropy arguments) will also be used here. For / G J 7 , set C(/) = {t> G 
^ : /lAf(i>) is constant}. 

Lemma 1.9 For u ~ i> and f drawn uniformly from J- , P(|{it, u} D C(f)| = 
l) = l_ e -n(«0. 

Finally, we need to know something about isoperimetry in the cube. A 
Hamming ball centered at xo in is any set of vertices B satisfying 

{u £ V: p(u, x ) < k} C B C {u £ V: p(u, x ) < k + 1} 

for some k < d. An even (resp. odd) Hamming ball is a set of vertices 
of the form B n £ (resp. B D O) for some Hamming ball B. We use the 
following result of Korner and Wei [10] . 
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Lemma 1.10 For every C C £ (resp. O) and D C V , there exists an even 
(resp. odd) Hamming ball C and a set D' such that \C'\ = \C\, \D'\ = \D\ 
and p(C',D')> p(C,D). 

I. 4 Outline 

The rest of the paper is organized as follows. 

In Section [2] we use Lemma 11.91 to reduce Theorem 11.41 to the problem 
of counting the number of homomorphisms which are predominantly on 
£. The easy lower bounds on the number of homomorphisms which take on 
four and five values are given in Section [3l In Section [5] we examine a general 
type of sum over small subsets of £ and establish some of its properties. In 
Section [5] we write down an explicit sum of the type examined in Section U 
for the number of homomorphisms which are predominantly on £. The 
rest of the paper is devoted to estimating this sum. In Section[6]we establish 
lower bounds on the sizes of neighbourhoods of single-parity sets in the cube. 
In Section [7] we arrive at the heart of the matter, showing that the set of nice 
subsets of £ can be "well-approximated" in a precise sense by members of a 
"small" collection; this allows us to swiftly complete the proof of Theorem 

II. 41 in Section [U We postpone a more detailed outline of the latter portion of 
the argument until the beginning of Section[71 Finally, in Section[9l we make 
some brief remarks on the proof and possible extensions of the techniques 
used. 

2 Reduction to mostly constant 

We begin the proof of Theorem 1 1.41 by using Lemma ll.9l to reduce the prob- 
lem to that of counting homomorphisms which mainly take a single value 
on £. 

There is an inherent odd-even symmetry in the problem; we now refor- 
mulate slightly to make use of this. Write 



For each f £ A write [/] for the equivalence class of / in B. Noting that R 
is constant on equivalence classes, we may define 



A = {f:V ^Z:u~v=> \f(u)-f(v)\ = 1} 



and write B for the quotient of A by the equivalence relation 




B i = {[f]eB:\R(f)\=i}. 
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Clearly \Bi\ = \Fi\ for each i {F is a complete set of representatives for B). 

For f £ A, we say that / is mostly constant on £ if there is some 
c such that {t> G £: 7^ c} is small (see Section 11.21 for the definition 
of small; the constant a in that definition will be specified in the proof of 
Lemma I2.2|) . and we define mostly constant on O analogously. These 
definitions respect the equivalence relation, so we may define 

B £ = {[/] G B: f is mostly constant on £}. 

Define B° analogously. By symmetry, \B £ \ = \B \ (any automorphism of 
Qd that sends £ to O induces a bijection between the two sets). 

Lemma 2.1 

\B £ nB°\=e~ n ^\B\. 

Proof: To specify an [/] G B D B° we first specify the predominant values 
of the representative / on £ and O. W.l.o.g. we may assume that the 
predominant value on £ is 0, and so the predominant value on O is one 
of ±1. We then specify the small sets from £ and O on which / does not 
take the predominant values, and finally the values of / on these small sets. 
Noting that once f(v) has been specified for any v G V there are most 2d+l 
values that / can take on any other vertex and that 2 M is a trivial lower 
bound on \B\, we get 

e-°W)|B|. 



B £ nB°\ < 

< 



Lemma 2.2 

\B\ = (2±e- n ^)\B £ \. 

Proof: For / G A, set C(f ) = {v G V: / 1 at(v) i s constant} (extending the 
definition given in Section f 1 . 3 j) . We choose a uniform member [f] of B by 
choosing f uniformly from T . For [f] and u, v G V, let Q u be the event {u G 
C(f)}, Qu the complementary event, Q u ^ = Q u H an d = Qu H Q,j. 
Write iC n = if«(f) for the set of vertices that can be reached from it in C(f) 
via steps of size exactly 2, and let Q*^ be the event {v G K u }. (Note that 
if /, g G A are equivalent then C(f) = C(g), so all these events are well 
defined.) 
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Let u and v be two vertices of the same parity. We claim that U Q^ v 
occurs with probability 1 — e~^ d \ For, let ua\a2 ■ ■ ■ (i2k-i v be a u-v path 
of length at most d (the diameter of Qd)- Writing ao for u and a>2k for v, we 
have 

U Q uv D D i=0 (Q ai a^r[ U Qa^a i+1 )- 

By LemmattH P(Q ai a^U<2ar ai+1 ) = 1 - e - n ^ for each i. Hence P(Q^U 
<&,) > 1 - de~ n W = 1 - e~ n W>, as claimed. 

We therefore have, for fixed u G V and any u of the same parity as u, 
V{Q* UV \Q U ) > 1 — c~ d , where c > 1 is fixed. So, conditioning on Q u , we have 

E(|{«:p(«,t;) even, v # tt }|) < (2/c) d , 

so that, by Markov's Inequality (with the constant d chosen so that 2/c < 
c'<2), 

P(|if„| < M — (c') d \Q u ) < (2/cc') d = e~^ d \ (1) 

If u £ C(f), then K u (f) = 0, so that P(|iT u | < M — (d) d \Qu) = 1. 
By symmetry, P((5uu) is the same for every adjacent u and u, and this 
together with Lemma O gives 1/2 + e"^ > P(Q U ), P(Qu) > l/2-e~ n ^. 
Combining these observations with ([I]), we get 

Pfll^l < M — (c') d ) < 1/2 + e- n{d \ 

Noting that f is constant on the neighbourhood of K u , this says (taking u 
to be any vertex in O) that there is a constant f3 < 2 such that 

P(f is constant on a subset of £ of size at least M - (3 d ) > 1/2 - e~^ d \ 

Taking a = f3 in the definition of small, this says 

18*1 > {1/2 - e- n(d) )\B\. 

The lemma now follows from Lemma l2.1i 

I 

It is now convenient to choose as a complete set of representatives for 
B the collection 

jr £ = {/ G A:£ \ / _1 (0) is small}. 

Set 

T? = {feT £ :\R(f)\=i}. 
Noting that |jf | > 2 M , we see that Theorem 11.41 will now follow from 
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Theorem 2.3 

\F £ \ < {e + e- nW )2 M (2) 

> (2^-2-e- n(d) )2 M (3) 

|jf| > (e-2^+l-e" n(d) )2 M . (4) 

It is this that we proceed to prove. 

3 Lower bounds on | and \T^\ 

The aim of this section is to prove ([3]) and (jU) . 

With each sparse A C £ of size at least 2 we associate a subset J-^(A) C 
J-f of size 

( 2 \A\ _ 2 )2 M - d \ A \ = 2 M M-\ A \l - 2~I A I +1 ) 

consisting of those / 6 .Fjf for which = [—2, 2] and i f~ 1 ({±2}) = A (on 
A, choose values for / from {±2}, choosing at least one 2 and at least one 
—2; on £ \ A give / value 0; and onO\ N(A) choose values from {±1}, all 
choices made independently). Then J 7 ^ (A) n J-jf (B) = whenever A ^ B. 
Noting that there are at least - Md 2 (^J 2 2 ) sparse subsets of £ of size 
k, and that for k < d, this number is (1 — e _fi ( d ))(^), we can lower bound 

\?$\ by 
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> 2 M ^ \{A C £: A sparse, |A| = A;}|M- fc (l - 2" 



> 2 M (l-e-^))^ (^|M- fc (l-2- fe+1 ) 



k>2 

d 

k 



k=2 

> 2 M (1 - e-^ d) ) - e-^)(l/fc!)(l - 2~ k+1 ) 

k=2 

d d 

> 2 M (l-e- Q( - d) )(Y / l/kl-2Y / 2~ k /kl) 

k=2 k=2 

> 2 M (l-e- Q(d) )((e-2) -2(y^-3/2)) 

> 2 M (e-2^+l-e-^), 

so we have dU). 

We do something similar for ([3]). With each nonempty, sparse A C £ we 
associate a subset .F4 (^4) C .7-4 of size 

2 l+M-d\A\ _ 2 M M -\A\ 2 -\A\+1 
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consisting of those / G Ff for which either R(f) = [—2, 1] or R(f) = [— 1, 2] 
and /~ 1 ({±2}) = A (choose a value from ±2 for / to take on A; on £ \ A 
give / value 0; and choose values from ±1 on O \ N(A), all choices made 
independently). So we have 



> 2 M E \i A ^£-A sparse, \A\ = k}\M~ k 2- 

fc>l 

> 2 M {2^~e-2-e-^). 



k+l 



4 Sums over small subsets of 8 



In this section, we examine a certain kind of sum that will arise when we 
try to write down an explicit expression for \J- £ \. Specifically, we prove 

Lemma 4.1 Suppose that g: 2 £ — > R + satisfies 

g(A) = H{g(A i ):A i -<A}, (5) 

g({y}) = c2~ d for all y G £ for some constant c > (6) 



and 



Then for all D C £ 



E fl(A)=e-°M. 
A nice 



(7) 



E g{A)-{l + c2- d )\ D \ 
AC D, A small 



-n(d) 



Remark: Because -< 0, any g satisfying l^j must also satisfy g[fh) = 1. 



Proof of Lemma \4-l\ ' All summations below are restricted to subsets of D. 
We begin by observing that (1 + c2~ d )^ = £ A c^2~ d ^ and that if A is 
sparse then g(A) = cl A l2- d l A l, so that 



E g(A)-{l + c2- d )\ D \ 
A small 



< YjgiA) + E"c |A| 2- d|A| + Yi"c\ A \2~ d \ A \ 



(8) 

where Y! is over A small and non-sparse, J2" is over A large and Y?" is over 
A non-sparse. 
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We bound each of the terms on the right-hand side of ([8]). For the first 
we have 

£'<7(A) < £ {g(A')g(A' \ A): A' nice, A small, A' -< A} 

< £ 9(A') £ 5(^4) 

A' nice A small 
= £ (9) 

^4 small 

For the second we have 

yv/ c |A| 2 -d]A| < V- C [A| 2 -«*|A| 

|A|>d 
1^1 



- „-n(d) 



Finally, for the third we have 



Combining ©, JTDD and dTTJ) we get 



£ - (1 + c2- rf )l^l 

A small 



-f2(cQ 



(10) 



x,x'£D, p(x,x')=2 A 

< \D\c 2 d 2 2- 2d (l + c2- d )W 

= e~ n ^. (11) 



e -n(«0 ( £ g(A) + ! (12) 
v,.A small / 

(13) 



(We get (fT3"|) from (fT2j) because the latter implies that X)^4 small sC^) * s 
bounded.) 



The most important g that we will be considering is 

g(A) = 2~l iV ( A )l+l s ( A )l 
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(recall that B(A) = {v G N(A):N(v) C A}). It's easy to see that this 
satisfies © and ([6]) (with c = 1). It is far from obvious that it satisfies ([7]); 
Sections [7] and [8] are devoted to the proof of this fact, which we state now 
for use in Section 



Theorem 4.2 



J2 2 -\N(A)\+\B(A)\ =e -fi(d)_ 
ACf nice 



5 Proof of (|2j) 

In this section, we write an explicit sum of the type introduced in Section U 
for \J- \ and use Lemma 14. II to estimate it, modulo Theorem 14.21 This will 
give ©. 

For each small A C E, set 

T £ (A) = {f €T £ :f- 1 (0)=£\A}. 

We may specify an / G J- (A) by the following procedure. First, noting 
that / must be either always positive or always negative on a 2-component 
of A, we specify a sign (±) for each such 2-component. Next, we specify a 
nested sequence 

4 = C2 5 C4 3 . . . D C 2 [d/2] ■ 

For each i = 1, . . . , [d/2], C 2 j = {u G £: |/(it)| > 2i}, Because the diameter 
of Qd is d, we have \f(u)\ < 2 [d/2] for all u G £ , so this second step completes 
the specification of / on £. Note that not every sequence of C2«'s gives rise 
to a legitimate / G J 7 ^. 

To specify / on O, we first specify a value from ±1 on each vertex of 
O \ N(A), and then, for each i = 1, . . . , [d/2], specify a value from 2i ± 1 for 
I /(it) I for each u G B(C2i) \ N(C2i+2) (note that the sign of f(u) for such 
u has been determined by the specification of signs on A). To see that this 
completes the specification of / on O, note that we have a choice for the 
value of l/l at u G N(A) iff / is constant on N(u) iff u G £(C 2 i) \ iV(C2i+2) 
for some 1 < i < [d/2] (setting C2[^/2]+2 = 0)i an d that in this case we can 
choose from two possible values, 2i ± 1 (see Figured]). 

So, noting that N(C2i+2) Q B(C2i) for each i = 1, . . . , [cZ/2] , we have 

[d/2] 

I J* (A) I = 2 c{A)+M -\ N ( A) \ + \ B ( A) \ V 2-l Ar ( c ' 2l )l + l B ( c ' 2 ')l 

i=2 
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Figure 1: A vertex in N(A) \ B{A) has neighbours in both £ \ A and A, 
and a vertex in N(C4) \ B{C^) has neighbours in both A \ C4 and C4, but 
a vertex in B{A) \ N{Ci) only has neighbours in A \ C4. 



where the sum — here and in the next line — is over all legitimate choices 
of C 2 2 ••• 2 C 2 [d/2}- Setting 

[d/2] 
i=2 

we get 

1^1 = 2 M h ( A )- 

A C £ small 

We claim that h satisfies all the conditions of Lemma 14.11 For A = {y} 
we have B(A) = 0, and so h(A) = 2 1 ~ d ; this gives © (with c = 2). To 
see that h satisfies ©, note that for each A C. £ small, each Cn is a small 
subset of A, and so we can crudely upper bound h(A) by 

/ \ [d/2] 

h(A) < 2 C ( A H^)I+|5(A)| J 2 -|jv(c)|+|b(c)| I 

\C C A small / 

< 2 c(A)-[iV(A)|+|B(A)| + 2 -d)« d + [d/2] (14) 

< (1 + (l))2 c(j4H7V(A)l+|B(A)l . 



The inequality in (|14p is obtained by applying Lemma [4.1 1 and Theorem 14.21 
and ([7]) for h now follows directly from Theorem 14.21 Finally, to establish 
([5]) for h, note that C2 2 C4 2 • • • 2 C2U/2] is a legitimate sequence of C"s 
for A iff C2 n D C4 n 2 • • • 2 C2W/2] n Aj is a legitimate sequence for 
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Ai for each 2-component A4 of A, from which the claimed factorization of 
h(A) follows. 

We can now easily establish ([2]) , thus completing the proofs of Theorems 
12.31 and II. 41 Applying Lemma 14.11 we have 



e2 



M 



< 2 M (| £>(A) - (1 - 2~ d+1 )^| + |(1 - 2- d+1 f\ - e|) 
_ e -n(d) 2 M 



where is over AC. £ small. 



6 Isoperimetry in the cube 

The aim of this section is to put some lower bounds on the neighbourhood 
size of a small set in Qd- We begin with 

Lemma 6.1 For all AC 8 or ACQ small, \A\ < (1 - Vt{l))\N{A)\. 

Proof: By symmetry, we need only prove this when 4Cf. Let small A C £ 
be given. Applying Lemma [LlUl with C = A and D = V \ (A U N(A)), 
we find that there exists an even Hamming ball A' with \A'\ = \A\ and 
|i\T(^4) I > |iV(A')|. So we may assume that A is a small even Hamming ball. 

We consider only the case where A is centered at an even vertex, w.l.o.g. 
0, the other case being similar. In this case, 

{v e £:p(v,0) < k} C A C {v G £:p(v,0) <k + 2} 

for some even k < d/2 — f2(d) (the bound on coming from the fact that 
A is small). For each < i < (k + 2)/2, set Bi = An {v:p(v,0) = 2i}, 
and N+(Bi) = N(B{) n {u:p(ti,0) = 2i + 1}. It's clear that N(A) = 
Uo<i<(k+2)/2N + (Bi) and that for i = 0, . . . , (k + 2)/2 



|JV+(S<)| - d-2t v ; 

= 1-^(1), (16) 

from which the lemma follows. The inequality in (|16p comes from the bound 
on k. The inequality in (|15|) is actually an equality except when i = (k+2)/2, 
in which case it follows from the observation that each vertex in Bk+2 has 
exactly d — (k + 2) neighbours in N + {B}.^)-, and each vertex in N + (Bf :+ 2) 
has at most {k + 2) + 1 neighbours in B^+2- 
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Lemma 16. 1 1 is true for all small A, but can be strengthened considerably 
when we impose stronger bounds on \ A\. In this direction, we only need the 
simple 

Lemma 6.2 If \A\ < d° (1 \ then \A\ < 0(l/d)\N(A)\, and if \A\ < d/2, 
then\N(A)\>d\A\-2\A\(\A\-l). 

Remark: Note that the second statement is true for all A, but vacuously so 
for \A\ > d/2. 

Proof of Lemma \6. 2\: If \A\ < d°^ l \ then we have k = 0(1) in the notation 
of Lemma I6.1| and repeating the argument of that lemma we get \A\ < 
0(l/d)\N(A)\. 

For the second part, note that each u £ A has d neighbours, of which at 
least d — 2(\A\ — 1) must be unique to it, since a pair of vertices in the cube 
can have at most two common neighbours. 

I 

From here on, the only properties of the cube that we will use are the 
isoperimetric bounds of Lemmas 16,11 and 16.21 

7 The main approximation 

We now begin the proof of Theorem 14.21 The approach will be to partition 
the set of A's over which we are summing according to the sizes of A, N(A), 
B{A) and N(B{A)) (note that the summand in Theorem 14.21 is constant on 
each partition class). The bulk of the work will be in bounding the sizes of 
the partition classes. 

Given A C £, set G = G(A) = N{A), B = B(A) and H = H(A) = 
N(B). In what follows, G, B and H are always understood to be G(A), 
B{A) and H(A) for whatever A is under discussion. Note that B C G and 
H c A. 

Given a, g, b and h, set 

H(a, g, b,h) = {AC£ 2-linked : \A\ = a, \G\ = g, \B\ = b and \H\ = h} . 
The aim of this section is to prove 
Lemma 7.1 For each a, g, b and h with a < a d , 

\H(a,g,b,h)\ <M 2 9 - h '^ 9 /^ d \ 
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from which we will easily derive Theorem 14.21 in Section [8j 

From now until the beginning of Section [8j a,g,b and h are fixed, and 
we write H for H(a, g, b, h). The proof of Lemma 17.11 involves the idea 
of "approximation". We begin with an informal outline. To bound \H\, we 
produce a small set U with the properties that each A 6 H is "approximated" 
(in an appropriate sense) by some U GW, and for each U £U, the number of 
A £ H that could possibly be "approximated" by U is small. (Each U £ U 
will consist of four parts; one each approximating G, A, H and B.) The 
product of the bound on \U\ and the bound on the number of A £ % that 
may be approximated by any U is then a bound on \T-L\. Another way of 
saying this is that we produce a set U and a map app:H — > U\ we then 
bound \H\ by 

\U\ < \U\m^\app~\U)\. 

The set U is itself produced by an approximation process — we first produce 
a small set V with the property that each A £ % is "weakly approximated" 
(in an appropriate sense) by some V £ V, and then show that for each V 
there is a small set W(V) with the property that for each A £ % that is 
"weakly approximated" by V, there is a W £ W(V) which approximates A; 
we then take U = LVevVV(y). (Each V £ V will consist of two parts; one 
each approximating G and H.) 

We now begin the formal discussion of Lemma 17.11 by introducing the 
two notions of approximation that we will use, beginning with the weaker 
notion. A covering approximation for A C £ is a pair (F',P') £ 2° x 2 e 
satisfying 

F' C G, N(F') D A (17) 

and 

P' C H, N(P') D B 

(see Figure [2]). An approximating quadruple for A C £ is a quadruple 
(F, S, P, Q) £ 2° x 2 £ x 2 s x 2° satisfying 

FQG, S^A, (18) 

d F (u) > d-Vd for all u £ S (19) 

d £ \ s (v) > d-Vd for all v £ O \ F (20) 

P^H, Q^B, (21) 

d P (u) > d-Vd for all -u G Q (22) 
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and 

do\q(v) >d-Vd for all v G £ \ P (23) 

(see Figure [3|). Note that if x is in A then all of its neighbours are in G, and 
if y is in O \ G then all of its neighbours are in £ \ A. If we think of S as 
"approximate A" and F as "approximate G" , (|19f) says that if x G £ is in 
"approximate A" then almost all of its neighbours are in "approximate G" , 
while (|20|) says that if y G is not in "approximate G" then almost all of its 
neighbours are not in "approximate A" , and there are similar interpretations 
for (|22|) and (j23l). 



G (=N(A)) 



1 [i 

Figure 2: F' satisfies both the conditions of (|17|) . 








Figure 3: The pair (F, S) satisfies (pi]). To satisfy CED and ([20D, each vertex 
u € S should have most (all but ^fd) of its neighbours in F, and each vertex 
v G O \ F should have most of its neighbours in £ \ S. 

There are two parts to the proof of Lemma I7.lt the "approximation" 
step (Lemma 17. 2[) and the "reconstruction" step (Lemma 17. 3p . We now 
state these two lemmas (from which Lemma 17. II follows immediately). 
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Lemma 7.2 There is a family 

U = U{a, g, b, h) C 2° x 2 s x 2 £ x 2° 

with 

\U\ < M2° (9logd/v ^ ) 
suc/i that every A £ "H /ias an approximating quadruple in IA . 

Lemma 7.3 For each (F, S, P, Q) € 2° x 2 £ x 2 £ x 2° satisfying HE), 
[Ml and $23$), there are at most 2S-*>-n(s/iogd) A> sin j{ satisfying and 

Lemma 17.21 follows directly from the next two lemmas. 
Lemma 7.4 There is a family 

V = V(a,g,b,h) C 2° x 2 s 

with 

I VI < M2°^ log2d// ^ 
smc/i iracrf eac/i A £ T~L has a covering approximation in V. 

Lemma 7.5 For each (F',P') £ 2° x 2 £ there is a family 

W = W(F', P', a, g, b, h) C 2° x 2 £ x 2 £ x 2° 

|W| < 2°^ logd// ^ 

suc/i that any A £ H for which (F',P') is a covering approximation has an 
approximating quadruple in W. 

We prove Lemmas 17.41 and 17.51 in Section T7. 11 We then prove Lemma l7.3l 
in Section 17.21 The main point in the proof of Lemma 17.51 is an algorithm 
which produces approximating quadruples from covering approximations; 
the idea for this algorithm is from |14j . 
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7.1 Proofs of Lemmas 17.41 and 17. 5t Approximations 



We begin with a simple observation about sums of binomial coefficients 
which we will draw on repeatedly (and usually without comment) in this 
section and the next. If k = o(n), we have 

§(") £ (1 + 0(W) t) 

< (1 + 0(k/n))(en/k) k 

< 2 (1+0 ( 1 )) Uog ( n / fc ). (24) 

Proof of Lemma \7.4\ ' For each A E T~L we obtain a covering approximation 
for A by taking F'(A) C G to be a cover of minimum size of A in the graph 
induced by G U A and P'{A) C to be a cover of minimum size of P in the 
graph induced by HUB. Note that P'(^) C N(F'(A)). 

By LemmaOl F'(A) is 4-linked (A is 2-linked, p(x, F'(A)) = 1 for each 
x £ A and p(y,^4) = 1 for each y E F'(A)). By Lemma [L7l < 
p(l + lnd)/d = 0(5 log d/d) and < + In d)/d = 0{g\ogd/d) 

(noting that h < g). 

We may therefore take V to be the set of all pairs (F', P') E 2° x2 £ with 
F' 4-linked and P' C iV(F'), and F',P' both of size at most 0(g log d/d). 
By Lemma 11.61 there are at most 

M ^2 2°^ log,i) = M2 0{ - 9l °^ 2 d / d) 

i<0(glogd/d) 

possibilities for F' (the factor of M is for the choice of a fixed vertex in F'), 
and, given F' , a further 

i<0(g log d/d) V Z / 

choices for P' (here we are using (I24p and the fact that |JV(P')| < d(7). The 
lemma follows. 



Proof of Lemma \ 7.fy Fix A C We give an algorithm which, for input 
(P',5') E 2° x 2 £ satisfying F' C G and S" 5 -4 produces an output 
(P,5) E 2° x 2 f satisfying (fT8|). (fT9|) and (pO|). 

Fix a linear ordering <C of V. 
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Step 1: If {u G A: d,Q\ pi (it) > Vd} / 0, pick the smallest (with respect 
to <C) u in this set and update F' by F' { — F' U N(u). Repeat this until 
{u G A:d G \ F ,{u) > Vd} = 0. Then set F" = F' and S" = S' \ {u G 
£: do\pn(u) > Vd} and go to Step 2. 

Step 2: If {w G O \ G: ds»(w) > Vd} / 0, pick the smallest (with respect 
to <C) w in this set and update S" by S" < — S" \ N(w). Repeat this until 
{w G O \ G:d s »{w) > Vd} = 0. Then set S = S" and F = F" U {w G 
0:ds(w) > Vd} and stop. 

Claim 7.6 The output of this algorithm satisfies \18\). \19\) and Ii20\) . 

Proof: To see that F C G and S 3 A, first observe that F" C G (since 
F' C G, and the vertices added to F' in Step 1 are all in G) and that 
S" 5 A (or Step 1 would not have terminated). We then have S 5 A since 
Step 2 deletes from S" only neighbours of O \ G, and FOG since the 
vertices added to F" at the end of Step 2 are all in G (or Step 2 would not 
have terminated). 

To verify CED and ([20]), note that d F «(«) > d — Vd for all u G 5"' by 
definition, S C 5", and i 7 5 F", so that dp(u) > d— Vd for all u G 5; and if 
w € 0\F then ds(w) < Vd (again by definition), so that dg\ s (w) > d — Vd 
for all w G O \ F. 



The proof of Lemma 17.51 involves a two-stage procedure. Stage 1 runs 
the algorithm described above with (F',£) as input. Stage 2 runs it with 
(P',0) as input and with the roles of £ and O reversed. By Claim [7T61 the 
quadruple (F, S, P, Q), where (F, S) is the output of Stage 1 and (P, Q) the 
output of Stage 2, is an approximating quadruple for A. 

Claim 7.7 The procedure described above has at most 2°^ gl ° sd ^^^ outputs 
as the input runs over those A G T~L for which (F' , P') is a covering approx- 
imation. 

Taking W to be the set of all possible outputs of the algorithm, Lemma 17.51 
follows. 

Proof of Claim \77?\ The output of Stage 1 of the algorithm is determined 
by the set of u's whose neighbourhoods are added to F' in Step 1, and the 
set of u;'s whose neighbourhoods are removed from S" in Step 2. 
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Each iteration in Step 1 removes at least yfd vertices from G, so there 
are at most g/Vd iterations. The u's in Step 1 are all drawn from A and 
hence N(F'), a set of size at most dg. So the total number of outputs for 
Step 1 is at most 

V ( dg )= 2 0(glosd/yq \ 

i<g/Vd ^ 1 ' 

We perform a similar analysis on Step 2. Each u G S" \ A contributes 
more than d — y/d edges to V(G), so initially |S"\.A| < gd/(d — V~d) = O(g). 
Each w used in Step 2 reduces this by at least y/d, so there are at most 
0{g/\fd) iterations. Each w is drawn from N(S"), a set which is contained 
in the fourth neighbourhood of F' (S" C N(G) by construction of S", G = 
N(A) and A C N(F')) and so has size at most d g. So as with Step 1, the 
total number of outputs for Step 2, and hence for Stage 1, is 2 0( - 9logd ^^\ 

Noting that h < g, a similar analysis applied to Stage 2 gives that that 
stage also has at most 2°( 9logc( / v/ ^ outputs, and the claim follows. 



7.2 Proof of Lemma 17. 3t Reconstruction 

We first note an important property of approximating quadruples. 
Lemma 7.8 If (F, S, P, Q) is an approximating quadruple for A S Ti then 

\S\ < \F\+0(g/Vd) (25) 
\Q\ < \P\ + 0(h/Vd). (26) 

Proof: Observe that \V(S, G)\ is bounded above by d\F\ + \fd\G \ F\ and 
below by d\A\ + (d - Vd)\S \A\ = d\S\ - Vd\S \ A\, giving 

\S\<\F\ + \(G\F)U(S\A)\/Vd, 

and that each u G {G \ F) U (S \ A) contributes more than d — \fd edges to 
V(G), a set of size gd, giving 

\(G\F)U(S\A)\<2gd/(d-Vd) = 0(g). 

These two observations together give (|25|) , The proof of (I26p is similar. 

I 

Lemma 17.31 now follows from 
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Lemma 7.9 For each (F, S, P, Q) G 2° x 2 s x 2 £ x 2° satisfying and 
I2fi\) . there are at most 29~ b - n (9/iogd) A's inU satisfying (ESP and [21\). 

Proof: For A £ T~L, write 

[A] = {ue£:N(u) CAT(,4)}, 

and write a' for \[A]\. Note that although G does not determine A, it 
does determine [A] . By Lemma 16.11 there is an absolute constant 7 > 
(independent of a' ,g,b and h) such that 

g — a > 75 and h — b > jh. (27) 

Say that Q is tight if \Q\ < 6 + 7/1/ log d, and slack otherwise, and that 
5 is tight if I S\ < g — 75/(4 log d) and slack otherwise. 

We now describe a procedure which, for input (F, S, P, Q), produces an 
output A which satisfies (fTHj) and (f2T|h The procedure involves a sequence 
of choices, the nature of the choices depending on whether S and Q are tight 
or slack. 

We begin by identifying a subset D of A which can be specified relatively 
"cheaply": if Q is tight, we pick BCQ with \B\ = b and take D = N(B); 
if Q is slack, we simply take D = P (recalling that PC H Q A). 

If S is tight, we complete the specification of A by choosing A \ D C 
S \ D. If S" is slack, we first complete the specification of G by choosing 
G\F C iV(S) \ F. Note that in this case, ^ implies 

|G\F| < 75/(3 log d). (28) 

We then complete the specification of A by choosing A \ D C [A] \ Z) (noting 
that we do know L4] \D at this point). 

This procedure produces all possible A £ rl satisfying (fTHj) and (f2T|) (and 
more). Before bounding the number of outputs, we gather together some 
useful observations. 

From (|25p and (|26p we have 

\S\ = 0(g) and \Q\ = 0(h). (29) 
If Q is tight then there are at most 

i<7/i/logd V 1 ^ 1 / i<-yh/lo S d \ / 

< 2 0(7Vl°g<*)log(0(logd/7)) 

< 2 7h / 2 (30) 
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possibilities for D, and in this case \D\ = h; while if Q is slack there is just 
one possibility for D, and in this case (using (|26p ) 



|D| = |P| > \Q\-n(h/Vd) 

> b + jh/ log d-n(h/Vd) 

> b + 7/1/(2 log d). (31) 

If 5 is slack then (since |JV(S)\F| < d\S\ < O(dg); see ([29])) the number 
of possibilities for G \ F is at most 

< 2^ 1+0 ( 1 )^ 79 ^ 31ogd ^ log ^ ( <ilogd / 7 ^ 

i<79/(31ogd) V 1 J 

< 2™/ 2 . (32) 

We now bound the number of outputs of the procedure, considering 
separately the four cases determined by whether S and Q are slack or tight. 

If S and Q are both tight then the number of possibilities for A is at 
most 

2hh/2] + [g-jg/(4logd)-h] < 2 <?-79/(4 log d)-6- 7 /t/2 _ 

(The first term in the exponent on the left-hand side corresponds to the 
choice of D (using (|30|) ). and the second to the choice of A \ D (note that 
since S and Q are both tight, |5 \ D\ < g — 7^/(41ogd) — h). To get the 



right-hand side, we use the second part of ([2] 

If S is tight and Q is slack then the total is at most 

2[9-7£//(41ogd)-b-7V( 21 °S a! )], (34) 

(Here there is no choice for D, and the exponent corresponds to the choice 
of A\D (using (|3T|)U 

If Q is tight then \ [A] \ D\ = a' — h, so that if S is slack (and Q tight) 
then the number of possibilities for A is at most 

2 hh/2]+[yg/2]+la'-h] < 29-79/2-6-7^/2 _ (35) 

(The first term on the left-hand side corresponds to the choice of D (using 
([30]) . the second to the choice of G \ F (using ([32]) ) and the third to the 
choice of A \ D. On the right-hand side, we use both parts of (I27p .) 

Finally, if Q is slack then \[A]\ D\ < a' - b - 7/1/(2 log d) (see §2)), so 
that if S and Q are both slack the number of possibilities for A is at most 

2 [79/2]+[o'-b-7V(21ogd)] < 2 9-73/2-6-7/i/(2 log d)_ ( 3 g) 
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(The first term on the left-hand side corresponds to the choice of G \ F and 
the second to the choice of A \ D. The right-hand side uses the first part of 

(EH).) 

Noting that h < g, the lemma follows from (i33l). (fMl) . (1351) and (p6l) . 



8 Proof of Theorem Q 

We say that a nice A C £ is of type I if \A\ < d/2, of type II if d/2 < 
\A\ < d 2 and of type III otherwise. We consider the portions of the sum 
in Theorem 14.21 corresponding to type I, II and III A J s separately. 

If A is of type I, then by Lemma EES \N(A)\ > d\A\ -2\A\(\A\ -1). Note 
also that in this case, B(A) = 0. By Lemma [1.61 for each 2 < i < d/2, there 
are at most M2°^ il °^ < 2 d+0 ^ il °^ 2-linked subsets off of size i. So 

d/2 

A of type I i=2 

= e- Q(d) . (37) 

We do something similar if A is of type II. Here Lemma 16.21 gives 
\N(A)\ > Cl(d)\A\ and \B(A)\ < 0{l/d)\A\ (recalling that N(B) C A), 
and so 

2 -\n(A)\+\b(A)\ < V" 2 d+ °( ilo z d ^ Q ( d ^ +0 ( 1 / d ') i 
A of type II i=d/2 

= e~ n( - d \ (38) 

We partition the set of j4's of type III according to the sizes of A, 
N(A),B(A) and H(A) (= N{B(A))) and use LemmaOto bound the sizes 
of the partition classes. In this case we have |iV(A)| > d 2 . So (summing 
only over those values of a, g, b and h for which H(a, g, b,h) ^ and g > d 2 , 
and with the inequalities justified below) 

£ 2-1^)1+1*04)1 = £ \n(a,g,b,h)\2-v +b 
A of type III a,,g,b,h 

< M 2- Q( - 9/losd) (39) 

a,g,b,h 
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< M 4 2~ n(s/logd) (40) 

g>d? 

< (M 4 /(l - 2- Q ( 1 /iogd))2^ 2 -n{d 2 /\ og d) 

= e~ n ^. (41) 

Here (|39|) is from Lemma [7.1l and in (|4U|) we use the fact that there are fewer 
than M choices for each of a, b and h. 

Combining ([371), & and (j4Tj) . we have Theorem 021 



9 Remarks 

The point of departure for our proof of Theorem 11.41 is Lemma 11.91 which 
allows us to focus immediately on those homomorphisms which are predom- 
inantly single-valued on one side of the cube. The proof of this lemma given 
in [7J relies heavily on the structure of the cube (in particular on the fact 
that the neighbourhoods of adjacent vertices induce a perfect matching), 
and it does not seem obvious at the moment how to get beyond this and 
generalize Theorem 11.41 to a larger class of graphs. 

On the other hand, the proofs of Theorem 14.21 and Lemma ED are much 
less dependent on the specific structure of the cube, using only the isoperi- 
metric bounds of Section [6j As such, it should be possible to extend these 
results considerably. To illustrate this, it is worth comparing Lemma 17.11 
with the main lemma of [14J. To state that, we need some notation. Let 
G be a ci-regular bipartite graph with bounded co-degree (i.e., every pair of 
vertices has a bounded number of common neighbours). Write X and Y for 
the bipartition classes of G. For any a' and g, set 

G(a',g) = {A<Z X: A 2-linked, \N(A)\ = g,\[A]\ < a'}, 

(recall that [A] = {x £ X:N(x) C N(A)}), and set 5 = (g - a')/g. Using 
slightly more versatile notions of approximation than those introduced in 
Section [7J the following is proved in [14J: 

Theorem 9.1 For d sufficiently large, and for any a' and g satisfying 1 > 
5 > log 9 d/d 2 , 

\G(a',g)\ < iX^-VCeiogd)). 

Notice that (by the results of Section [6]) the sum in Theorem 14.21 is ex- 
tending only over sets A which satisfy (|iV(^4)| — |^4|)/|iV(^4)| > 17(1), a much 
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stronger condition than that imposed in Theorem l9.11 By slightly modifying 
our notions of approximation, we may extend the validity of Lemma l7.1l to 
cover a similar range as Theorem 19.11 However, the analysis is considerably 
more involved, and we do not do so here. 

Acknowledgement: The author thanks Jeff Kahn for numerous helpful 
discussions. 
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